Efficient parallelization for 3d-3v sparse grid Particle-In-Cell: Shared memory architectures

نویسندگان

چکیده

Particle-In-Cell (PIC) schemes are ones of the most broadly used numerical methods in kinetic simulation plasmas. The contribution present paper is dedicated to introduction novel parallelization strategies specific shared memory architectures tailored for implementing sparse grid reconstructions. These operate different parallelisms genuine Sparse-PIC obtain speed-up exceeding 100 on 128 cores using a strong scaling study 3d-3v simulation. On top that, substantial gains (two orders magnitude) introduced sequential as well parallel computation electric field thanks hierarchization procedure. It consists decomposing information carried by grids onto hierarchical basis functions, entailing significantly reduced number operations.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Parallelization of Unstructured Reductions on Shared Memory Parallel Architectures

This paper presents a new parallelization method for an ef-cient implementation of unstructured array reductions on shared memory parallel machines with OpenMP. This method is strongly related to parallelization techniques for irregular reductions on distributed memory machines as employed in the context of High Performance Fortran. By exploiting data locality, synchronization is minimized with...

متن کامل

Comparison of Parallelization Frameworks for Shared Memory Multi-Core Architectures

The development of standard multi-core processors changed in the last years moving from bigger, more complex, and faster cores to putting several more simple cores onto one chip. This changed also the way programs are written in order to leverage the processing power of multiple cores of the same processor. In the beginning, programmers had to divide and distribute the work by hand to the avail...

متن کامل

cient Parallelization of Unstructured Reductions on Shared Memory Parallel Architectures ?

This paper presents a new parallelization method for an efcient implementation of unstructured array reductions on shared memory parallel machines with OpenMP. This method is strongly related to parallelization techniques for irregular reductions on distributed memory machines as employed in the context of High Performance Fortran. By exploiting data locality, synchronization is minimized witho...

متن کامل

Efficient and Scalable Algorithms for Smoothed Particle Hydrodynamics on Hybrid Shared/Distributed-Memory Architectures

This paper describes a new fast and implicitly parallel approach to neighbour-finding in multi-resolution Smoothed Particle Hydrodynamics (SPH) simulations. This new approach is based on hierarchical cell decompositions and sorted interactions, within a task-based formulation. It is shown to be faster than traditional tree-based codes, and to scale better than domain decomposition-based approac...

متن کامل

Shared Memory Parallelization of Data

With the availability of large datasets in application areas like bioinformatics, medical informatics, scientific data analysis, financial analysis, telecommunications, retailing, and marketing, it is becoming increasingly important to execute data mining tasks in parallel. At the same time, technological advances have made shared memory parallel machines commonly available to organizations and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Computational Physics

سال: 2023

ISSN: ['1090-2716', '0021-9991']

DOI: https://doi.org/10.1016/j.jcp.2023.112022